Introducing DOTUR, a computer program for defining operational taxonomic units and estimating species richness.

نویسندگان

  • Patrick D Schloss
  • Jo Handelsman
چکیده

Although copious qualitative information describes the members of the diverse microbial communities on Earth, statistical approaches for quantifying and comparing the numbers and compositions of lineages in communities are lacking. We present a method that addresses the challenge of assigning sequences to operational taxonomic units (OTUs) based on the genetic distances between sequences. We developed a computer program, DOTUR, which assigns sequences to OTUs by using either the furthest, average, or nearest neighbor algorithm for each distance level. DOTUR uses the frequency at which each OTU is observed to construct rarefaction and collector's curves for various measures of richness and diversity. We analyzed 16S rRNA gene libraries derived from Scottish and Amazonian soils and the Sargasso Sea with DOTUR, which assigned sequences to OTUs rapidly and reliably based on the genetic distances between sequences and identified previous inconsistencies and errors in assigning sequences to OTUs. An analysis of the two 16S rRNA gene libraries from soil demonstrated that they do not contain enough sequences to support a claim that they contain different numbers of bacterial lineages with statistical confidence (P > 0.05), nor do they contain enough sequences to provide a robust estimate of species richness when an OTU is defined as containing sequences that are no more than 3% different from each other. In contrast, the richness of OTUs at the 3% level in the Sargasso Sea collection began to plateau after the sampling of 690 sequences. We anticipate that an equivalent extent of sampling for soil would require sampling more than 10,000 sequences, almost 100 times the size of typical sequence collections obtained from soil.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Defining DNA-based operational taxonomic units for microbial-eukaryote ecology.

DNA sequence information has increasingly been used in ecological research on microbial eukaryotes. Sequence-based approaches have included studies of the total diversity of selected ecosystems, studies of the autecology of ecologically relevant species, and identification and enumeration of species of interest for human health. It is still uncommon, however, to delineate protistan species base...

متن کامل

Introducing SONS, a tool for operational taxonomic unit-based comparisons of microbial community memberships and structures.

The recent advent of tools enabling statistical inferences to be drawn from comparisons of microbial communities has enabled the focus of microbial ecology to move from characterizing biodiversity to describing the distribution of that biodiversity. Although statistical tools have been developed to compare community structures across a phylogenetic tree, we lack tools to compare the memberships...

متن کامل

Divergence thresholds and divergent biodiversity estimates: can metabarcoding reliably describe zooplankton communities?

DNA metabarcoding is a promising method for describing communities and estimating biodiversity. This approach uses high-throughput sequencing of targeted markers to identify species in a complex sample. By convention, sequences are clustered at a predefined sequence divergence threshold (often 3%) into operational taxonomic units (OTUs) that serve as a proxy for species. However, variable level...

متن کامل

NUMERICAL TAXONOMIC STUDY OF THE IRANIAN SPECIES OF ALYSSUM L. BASED ON MORPHOLOGICAL CHARACTERS

The genus Alyssum L. belongs to the subtribe Alyssinae, tribe Alysseae and family Cruciferae (Brassicaceae). This genus is one of the largest genera of the family of Cruciferae in Iran, and seems to be the most problematic genus in which the boundary of certain species is not completely clear due to the polymorphism of morphological characters. The main objective of this research is to stud...

متن کامل

On defining and quantifying biotic homogenization

Ongoing species invasions and extinctions are changing biological diversity in different ways at different spatial scales. Biotic homogenization (or BH) refers to the process by which the genetic, taxonomic or functional similarities of regional biotas increase over time. It is a multifaceted process that encompasses species invasions, extinctions and environmental alterations, focusing on how ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Applied and environmental microbiology

دوره 71 3  شماره 

صفحات  -

تاریخ انتشار 2005